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Abstract. The existing call-by-need A calculi describe lazy evaluation 
via equational logics. A programmer can use these logics to safely as- 
certain whether one term is behaviorally equivalent to another or to 
determine the value of a lazy program. However, neither of the existing 
calculi models evaluation in a way that matches lazy implementations. 

Both calculi suffer from the same two problems. First, the calculi never 
discard function calls, even after they are completely resolved. Second, 
the calculi include re-association axioms even though these axioms are 
merely administrative steps with no counterpart in any implementation. 

In this paper, we present an alternative axiomatization of lazy evalu- 
ation using a single axiom. It eliminates both the function call retention 
problem and the extraneous re-association axioms. Our axiom uses a 
grammar of contexts to describe the exact notion of a needed compu- 
tation. Like its predecessors, our new calculus satisfies consistency and 
standardization properties and is thus suitable for reasoning about be- 
havioral equivalence. In addition, we establish a correspondence between 
our semantics and Launchbury's natural semantics. 

Keywords: call-by-need, laziness, lambda calculus 



1 A Short History of the A Calculus 

Starting in the late 1950s, programming language researchers began to look 
to Church's A calculus for inspiration. Some used it as an analytic tool to 
understand the syntax and semantics of programming languages, while others 
exploited it as the basis for new languages. By 1970, however, a disconnect had 
emerged in the form of call-by-value programming, distinct from the notion of 
j3 and normalization in Church's original calculus. Plotkin [25] reconciled the 
A calculus and Landin's SECD machine for the ISWIM language [13 with the 
introduction of a notion of correspondence and with a proof that two distinct 
variants of the A calculus corresponded to two distinct variants of the ISWIM 
programming language: one for call- by- value and one for call-by-name. 

In the early 1970s, researchers proposed call-by-need [121114128]. a third kind 
of parameter passing mechanism that could be viewed as yet another variant 
of the ISWIM language. Call-by-need is supposed to represent the best of both 
worlds. While call- by- value ISWIM always evaluates the argument of a function, 
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the call-by-name variant evaluates the argument every time it is needed. Hence, 
if an argument (or some portion) is never needed, call- by-name wins; otherwise 
call-by-value is superior because it avoids re-evaluation of arguments. Call-by- 
need initially proceeds like call-by-name, evaluating a function's body before 
the argument — until the value of the argument is needed; at that point, the 
argument is evaluated and the resulting value is used from then onward. In 
short, call-by-need evaluates an argument at most once, and only if needed. 

Since then, researchers have explored a number of characterizations of call- by- 
need [811 11131151231241^5] . Concerning this paper, three stand out. Launchbury's 
semantics |17j specifies the meaning of complete programs with a Kahn-style 
natural semantics. The call- by-need A calculi of Ariola and Felleisen |2I3I4] . and 
of Maraist, Odersky, and Wadler [4120121] are equational logics in the spirit of 
the A calculus. 

The appeal of the A calculus has several reasons. First, a calculus is sound 
with respect to the observational (behavioral) equivalence relation [35]. It can 
therefore serve as the starting point for other, more powerful logics. Second, 
its axioms are rich enough to mimic machine evaluation, meaning programmers 
can reduce programs to values without thinking about implementation details. 
Finally, the A calculus gives rise to a substantial meta-theory |5l7j from which 
researchers have generated useful and practical results for its cousins. 

Unfortunately, neither of the existing by-need calculi model lazy evaluation 
in a way that matches lazy language implementations. Both calculi suffer from 
the same two problems. First, unlike the by-name and by-value calculi, the by- 
need calculi never discard function calls, even after the call is resolved and the 
argument is no longer needed. Lazy evaluation does require some accumulation 
of function calls due to the delayed evaluation of arguments but the existing 
calculi adopt the extreme solution of retaining every call. Indeed, the creators of 
the existing calculi acknowledge that a solution to this problem would strengthen 
their work but they could not figure out a proper solution. 

Second, the calculi include re-association axioms even though these axioms 
have no counterpart in any implementation. The axioms are mere administrative 
steps, needed to construct /3-like redexes. Hence, they should not be considered 
computationally on par with other axioms. 

In this paper, we overcome these problems with an alternative axiomati- 
zation. Based on a single axiom, it avoids the retention of function calls and 
eliminates the extraneous re-association axioms. The single axiom uses a gram- 
mar of contexts to describe the exact notion of a needed computation. Like its 
predecessors, our new calculus satisfies consistency and standardization proper- 
ties and is thus suitable for reasoning about behavioral equivalence. In addition, 
we establish an intensional correspondence with Launchbury's semantics. 

The second section of this paper recalls the two existing by-need calculi in 
some detail. The third section presents our new calculus, as well as a way to 
derive it from Ariola and Felleisen's calculus. Sections @] and [5] show that our 
calculus satisfies the usual meta-theorems and that it is correct with respect to 
Launchbury's semantics. Finally, we discuss some possible extensions. 



The Call-by-need Lambda Calculus, Revisited 3 

2 The Original Call-by-need A Calculi 

The original call-by-need A calculi are independently due to two groups: Ariola 
and Felleisen [213] and Maraist, et al. |20|21] . They were jointly presented at 
POPL in 1995 [4 . Both calculi use the standard set of terms as syntax: 

e — x \ Xx.e \ ee (Terms) 

Our treatment of syntax employs the usual conventions, including Barcndregt's 
standard hygiene condition for variable bindings [5]. Figured] specifies the calcu- 
lus of Maraist et al., X mow , and A a /, Ariola and Felleisen's variant. Nonterminals 
in some grammar productions have subscript tags to differentiate them from 
similar sets elsewhere in the paper. Unsubscriptcd definitions have the same 
denotation in all systems. 



v m — x | Xx.e 

C = [ ] I Xx.C \Ce\eC 

{Xx.C[x]) v m = (Xx.C[v m ]) v m (V) 

(As.ei) e 2 e 3 = (Xx.e! e 3 ) e 2 (C) 

(Az.ei)((Ay.e 2 )e 3 ) = (.4) 
(Ay.(Ax.ei) e 2 ) e 3 

(Xx.ei) e 2 = ei,x $ fv(ei) (Q) 



v — Xx.e 
a af = v | (Xx.a a f) e 

Eaf = [ ] | E af e | (Xx.E af ) e | (Xx.E af [x}) E af 



(Xx.E af [x]) v = (Xx.E af [v]) v (deref) 
(Xx.a a} ) ei e 2 = (Xx.a a fe 2 ) e\ (lift) 
(Xx.E a f[x}) ((Xy.a a} ) e) = (assoc) 
{Xy.(Xx.E a f\x\) a a f) e 



Fig. 1. Existing call-by-need A calculi (left: X mow , right: X a f) 

In both calculi, the analog to the (3 axiom — also called a basic notion of re- 
duction [5] — replaces variable occurrences, one at a time, with the value of the 
function's argument. Value substitution means that there is no duplication of 
work as far as argument evaluation is concerned. The function call is retained 
because additional variable occurrences in the function body may need the ar- 
gument. Since function calls may accumulate, the calculi come with axioms that 
re-associate bindings to pair up functions with their arguments. For example, 
re-associating (Xx.(Xy.Xz.z) v y ) v x v z in A a f exposes a rfere/redex: 



lift lift 

(Xx.(Xy. Xz.z )v y )v x v z — > (Xx.(Xy. Xz.z ) v v v z ) v x — > (Xx.(Xy.(Xz.z)v z )v y )v x 



The two calculi differ from each other in their timing of variable replacements. 
The X mow calculus allows the replacement of a variable with its value anywhere in 
the body of its binding A. The A a / calculus replaces a variable with its argument 
only if evaluation of the function body needs it, where "need" is formalized via 
so-called evaluation contexts [E a f). Thus evaluation contexts in X a f serve the 
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double purpose of specifying demand for arguments and the standard reduction 
strategy. The term (Xx.Xy.x) v illustrates this difference between the two calculi. 
According to X mow , the term is a V redex and reduces to (Xx.Xy.v) v, whereas in 
X a [, the term is irreducible because the x occurs in an inner, unapplied A, and 
is thus not "needed." 

Also, X mow is more lenient than X a f when it comes to re-associations. The X a f 
calculus re-associates the left or right hand side of an application only if it has 
been completely reduced to an answer, but X mow permits re-association as soon 
as one nested function layer is revealed. In short, X mow proves more equations 
than X a f, i.e., A a / C 

In X a f, programs reduce to answers: 

eval /(e) = done iff there exists an answer a a f such that A n / he = a a f 

In contrast, Maraist et al. introduce a "garbage collection" axiom into X mow to 
avoid answers and to use values instead. This suggests the following definition: 

eval mow (e) = done iff there exists a value v m such that X mow ^ e = v m 

This turns out to be incorrect, however. Specifically, let eval„ arne be the analo- 
gous call-by-name evaluator. Then eval a y = eval„ ame but eval mow ^ eval„ ame . 
Examples such as (Xx.Xy.x) Q confirm the difference. 

In recognition of this problem, Maraist et al. use Ariola and Felleisen's ax- 
ioms and evaluation contexts to create their Curry-Feys-style standard reduction 
sequences. Doing so reveals the inconsistency of X mow with respect to Plotkin's 
correspondence criteria [25) . According to Plotkin, a useful calculus corresponds 
to a programming language, meaning its axioms (1) satisfy the Church-Rosser 
and Curry-Feys Standardization properties, and (2) define a standard reduction 
function that is equal to the evaluation function of the programming language. 
Both the call-by-name and the call-by- value A calculi satisfy these criteria with 
respect to call- by-name and call- by- value SECD machines for ISWIM, respec- 
tively. So does X a f with respect to a call-by-need SECD machine, but some of 
Amo-uj's axioms cannot be used as standard reduction relations. 

Finally, the inclusion of Q is a brute-force attempt to address the function call 
retention problem. Because Q may discard arguments even before the function is 
called, both sets of authors consider it too coarse and acknowledge that a tighter 
solution to the function call retention issue would "strengthen the calculus and 
its utility for reasoning about the implementations of lazy languages" [3] . 

3 A New Call-by-need A Calculus 

Our new calculus, X nee d, uses a single axiom, f3 nee d- The new axiom evaluates the 
argument when it is first demanded, replaces all variable occurrences with that 
result, and then discards the argument and thus the function call. In addition, 
the axiom performs the required administrative scope adjustments as part of the 
same step, rendering explicit re-association axioms unnecessary. In short, every 
reduction step in our calculus represents computational progress. 
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Informally, to perform a reduction, three components must be identified: 

1. the next demanded variable, 

2. the function that binds that demanded variable, 

3. and the argument to that function. 

In previous by-need calculi the re-association axioms rewrite a term so that the 
binding function and its argument are adjacent. 

Without the re-association axioms, finding the function that binds the de- 
manded variable and its argument requires a different kind of work. The following 
terms show how the demanded variable, its binding function, and its argument 
can appear at seemingly arbitrary locations in a program: 

• (Xx.(Xy.Xz.x) e y ) e x e z 

• (Xx.(Xy.Xz.y)ey) e x e z 

• (Xx.(Xy. Xz.z ) e„) e x e z 

Our f3 nee d axiom employs a grammar of contexts to describe the path from a 
demanded variable to its binding function and from there to its argument. 

The first subsection explains the syntax and the contexts of A nee d in a gradual 
fashion. The second subsection presents the fi nee d axiom and also shows how to 
derive it from Ariola and Felleisen's X a f calculus. 



3.1 Contexts 

Like the existing by-need calculi, the syntax of our calculus is that of Church's 
original calculus. In X nee d, calculations evaluate terms e to answers A[v], which 
generalize answers from Ariola and Felleisen's calculus: 

e = x | Xx.e | e e (Terms) 

v = Xx.e (Values) 

a = A[v] (Answers) 

A = [ ] A [Ax. A] e (Answer Contexts) 

Following Ariola and Felleisen, the basic axiom uses evaluation contexts to 
specify the notion of demand for variables: 

E =[]\Ee \ ... (Evaluation Contexts) 

The first two kinds, taken from A /, specify that a variable is demanded, and that 
a variable in the operator position of an application is demanded, respectively. 

Since the calculus is to model program evaluation, we are primarily interested 
in demanded variables under a A-abstraction. This kind of evaluation context is 
defined using an answer context A: 



E = ...\A[E}\ 



(Another Evaluation Context) 
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Using answer contexts, this third evaluation context dictates that demand exists 
under a A if a corresponding argument exists for that A. Note how an answer 
context descends under the same number of As as arguments for those As. In 
particular, for any term A[Xx.e\] e 2 , e 2 is always the argument of \x.e\. The 
third evaluation context thus generalizes the function-is-next-to-argument re- 
quirement found in both call-by-name and call-by-value. The generalization is 
needed due to the retention of function calls in X„ ee d- 

Here are some example answer contexts that might be used: 

A = (Xx.[ ])e x 



Ai = (Ax. (Xy.[ ]) e v ) e x 



A 2 = (Xx. (Ay. (Az.[ ]) e z ) e y ) e x 



An underbrace matches each function to its argument. The examples all juxta- 
pose functions and their arguments. In contrast, the next two separate functions 
from their arguments: 

A 3 = (Xx.Xy.Xz.[ ])e x e y e z 



A 4 = (Xx.(Xy.Xz.[ ]) e y ) e x e z 



To summarize thus far, when a demanded variable is discovered under a A, 
the surrounding context looks like this: 

A[E[x\] 

where both the function binding x and its argument are in A. The decomposition 
of the surrounding context into A and E assumes that A encompasses as many 
function-argument pairs as possible; in other words, it is impossible to merge 
the outer part of E with A to form a larger answer context. 

To know which argument corresponds to the demanded variable, we must 
find the A that binds x in A. To this end, we split answer contexts so that we 
can "highlight" a function- argument pair within the context: 

A = [ ] | A[A] e (Partial Answer Contexts-Outer) 

V V 

A = [ ] | A[Aa;.A] (Partial Answer Contexts-Inner) 

Using these additional contexts, any answer context can be decomposed into 



A[A[Xx.A[ ]] e] 
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where e is the argument of Ax.A[ ]. For a fixed function-argument pair in an 

A V A 

answer context, this partitioning into A, A, and A is unique. The A subcontext 
represents the part of the answer context around the chosen function-argument 

V 

pair; the A subcontext represents the part of the answer context in its body; and 
A here is the subcontext between the function and its argument. Naturally we 

A V 

must demand that A composed with A is an answer context as well so that the 
overall context remains an answer context. The following table lists the various 
subcontexts for the example A 4 for various function-argument pairs: 





A 4 = 


{Xx.{Xy.Xz.[ ])e y )e x e z 


A 

A = 


[ ]e z 


(Xx.[ })e x e z 


[ ] 


A = 


[ ] 


[ ] 


(Xx.(Xy.[ })e y )e x 


V 

A = 


(Xy.Xz.[ })e y 


Xz.[ } 


[ ] 


A 4 = 


A[A[Xx.A] e x ] 


A[A[Xy.A] e v ] 


A[A[te.A] e z ] 



Now we can define the fourth kind of evaluation context: 

E = ... I i[A[Aa;.I[S[a;]]] E], where A[A] £ A (Final Eval. Context) 

This final evaluation context shows how demand shifts to an argument when a 
function parameter is in demand within the function body. 



3.2 The j3 nee d Axiom and a Derivation 

Figure [2] summarizes the syntax of X n eed as developed in the preceding section^ 
In this section we use these definitions to formulate the j3 axiom for our calculus. 

(Terms) 
(Values) 
(Answers) 
(Answer Contexts) 

(Partial Answer Contexts-Outer) 

(Partial Answer Contexts-Inner) 

(Evaluation Contexts) 

where A[A] G A 

Fig. 2. The syntax and contexts of the new call-by-need A calculus, Xneed- 

Here is the single axiom of X nee d. 

AlA^Xx.AiElx]]] A 2 [v]j = ^[^[A^Kx:^}]]], (/3 need ) 
where A [A] e A 



e — 


x \ Xx.e \ ee 




v — 


Xx.e 




a — 


A[v] 




A = 


[ } | A[Xx.A]e 




A = 


[ ] 1 A[A] e 




V 

A = 


[ ] | A[Xx.A] 




E = 


[ }\Ee\A[E] 


A[A[Xx.A[E[x]]]E] 



1 We gratefully acknowledge Casey Klein's help with the A production. 
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A Pneed redex determines which parameter x of some function is "in demand" 
and how to locate the corresponding argument Ai [v] , which might be an answer 
not necessarily a value. The contexts from the previous section specify the path 
from the binding position (A) to the variable occurrence and the argument. A 
Pneed reduction substitutes the value in A2 [v] for all free occurrences of the func- 
tion parameter — just like in other A calculi. In the process, the function call is 
discarded. Since the argument has been reduced to a value, there is no duplica- 
tion of work, meaning our calculus satisfies the requirements of lazy evaluation. 
Lifting A2 to the top of the evaluation context ensures that its bindings remain 
intact and visible for v. 

Here is a sample reduction in \ nee d, where — > is the one-step reduction: 

((Xx.(Xy.Xz.z y x) Xy.y) Xx.x) Xz.z (1) 

— > (Xx.( Xy.(Xz.z) y x ) Xy.y ) Xx.x (2) 

— > (Xx.(( Xz.z ) Xy.y) x) Xx.x (3) 

— > (Xx.(Xy.y) x) Xx.x (4) 

The "in demand" variable is in bold; its binding A and argument are underlined. 
Line 1 is an example of a reduction that involves a non-adjoined function and 
argument pair. In line 2, the demand for the value of z (twice underlined) triggers 
a demand for the value of y\ line 4 contains a similar demand chain. 



v — Xx.e (Values) 

a a f = A af [v] (Answers) 

A a f= [ ] I (Xx.A a fje (Answer Contexts) 

E a f= [ ] I E af e I A a f[E a f] I (\x.E a f[x}) E af (Evaluation Contexts) 



(Xx.E af [x\) V = E a f[x]{x :=V} (P'need) 

(\x.A a f[v]) ei e 2 = (Xx.A a f[v e 2 ]) e± (lift') 
(Xx.E af [x]) ((Xy.A a f[v}) e) = (Xy.A af [(Xx.E af [x]) v]) e (assoc') 



Fig. 3. A modified calculus, X a f. mo d- 



To furnish additional intuition into f3 nee d, we use the rest of the section to 
derive it from the axioms of X a f. The X a f. mo d calculus in figure [3] combines X a f 
with two insights. First, Garcia et al. |13j observed that when the answers in A a /'s 
lift and assoc redexes arc nested deeply, multiple re-associations are performed 
consecutively. Thus we modify lift and assoc to perform all these re-associations 
in one stepo The modified calculus defines answers via answer contexts, A a f, and 
the modified lift' and assoc' axioms utilize these answer contexts to do the multi- 
step re-associations. Thus programs in this modified calculus reduce to answers 

2 The same modifications cannot be applied to C and A in X m ow because they allow 
earlier re-association and thus not all the re-associations are performed consecutively. 
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^4 a /[u]. Also, the A a f answer contexts are identical to the third kind of evaluation 
context in \ a f-mod and the new definition of E a f reflects this relationship. 

Second, Maraist et al. observed that once an argument is reduced to 
a value, all substitutions can be performed at once. The f3' need axiom exploits 
this idea and performs a full substitution. Obviously f3' need occasionally performs 
more substitutions than deref. Nevertheless, any term with an answer in A a / 
likewise has an answer when reducing with P' need - 

Next an inspection of the axioms shows that the contractum of a assoc! redex 
contains a /3' need redex. Thus the assoc' re-associations and /3' need substitutions 
can be performed with one merged axiomjfl 

(\x.E af [x\) A af [v] = A af [E af [x]{x:=v}] 

The final step is to merge lift' with /3^ eed , which requires our generalized 
answer and evaluation contexts. A naive attempt may look like this: 

A 1 {\x.E[x\] A 2 [v] = A^lElxjix-^v}]] {^ e J 

As the examples in the preceding subsection show, however, the binding oc- 
currence for the "in demand" parameter x may not be the inner-most binding 
A once the re-association axioms are eliminated. That is, in comparison with 
Pneed, fi'need incorrectly assumes E is always next to the binder. We solve this 
final problem with the introduction of partial answer contexts. 

4 Consistency, Determinism, and Soundness 

If a calculus is to model a programming language, it must satisfy some essen- 
tial properties, most importantly a Church-Rosser theorem and a Curry-Feys 
standardization theorem [25]. The former guarantees consistency of evaluation; 
that is, we can define an evaluator function with the calculus. The latter implies 
that the calculus comes with a deterministic evaluation strategy. Jointly these 
properties imply the calculus is sound with respect to observational equivalence. 

4.1 Consistency: Church-Rosser 

The Xneed calculus defines an evaluator for a by-need language: 

eval nee d(e) = done iff there exists an answer a such that Xneed l~ e = a 

To prove that the evaluator is indeed a (partial) function, we prove that the 
notion of reduction satisfies the Church-Rosser property. 

Theorem 1. eval need is a partial function. 

Proof. The theorem is a direct consequence of lemma [T] (Church-Rosser). 
3 Danvy et al. [8] dub a /3" eed redex a "potential redex" in unrelated work. 
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Our strategy is to define a parallel reduction relation for X nee d [5] . Define — > 
to be the compatible closure of a (3 n eed reduction, and — ft to be the reflexive, 
transitive closure of — K Additionally, define => to be the relation that reduces 
Pneed redexes in parallel. 

Definition 1 (=>). 

e e 

A[A 1 [\x.A[E[x]]]A 2 [v]] A'lA'^A'zlA'lE'lx^ix-^v'}}}}, 

ifA[A] G A, e A, A =>■ 

A 2 A' 2 ,A A', E=>E', v v' 
ei e 2 => e[ e' 2 , if ei => e[, e 2 =^ e' 2 
Ax.e Ax.e', if e =^ e' 

The parallel reduction relation =4> relies on notion of parallel reduction for 
contexts; for simplicity, we overload the relation symbol to denote both relations. 

Definition 2 (=> for Contexts). 

[]=M] 

t4i[Ax.A 2 ] e =>• Ai[Ax.A' 2 ] e', i/Ai =>- A[, A 2 A 2 , e e' 
e => e', i/ A => A\ A => A', e => e' 

A[Aa;.I] => A'[Aa:.I'], i/ A => A', I => A' 
Ee^E'e', if E => e ^ e' 

if A^ A', E ^ E' 

A[A[Xx.A[E 1 [x]]]E 2 ] i'[A / [A^.i / [Bi[a;]]]^], 

i =>• i', ^1 £ 2 £ 2 

Lemma 1 (Church- Rosser). If e —ft ei and e — » e2, i/iera i/iere exists a term 
e' such that e\ —ft e' and e 2 —ft e' . 

Proof. By lemma [51 =>■ satisfies a diamond property. Since => extends -ft, —ft is 
also the transitive-reflexive closure of =>, so —ft also satisfies a diamond property. 

Lemma 2 (Diamond Property of =>). Ife=>e% and e =>• e2, £/iere exists e' 
such that e\ => e' and e 2 =>■ e'. 

Proof. The proof proceeds by structural induction on the derivation of e =£- ei. 
4.2 Deterministic Behavior: Standard Reduction 

A language calculus should also come with a deterministic algorithm for applying 
the reductions to evaluate a program. Here is our standard reduction: 

E[e] i — ► E[e'], where e (3 need e' 

Our standard reduction strategy picks exactly one redex in a term. 
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Proposition 1 (Unique Decomposition). For all closed terms e, e either is 
an answer or e — E[e'] for a unique evaluation context E and f3 nee d redex e' . 

Proof. The proof proceeds by structural induction on e. 

Since our calculus satisfies the unique decomposition property, we can use 
the standard reduction relation to define a (partial) evaluator function: 

eval^ r eed (e) = done iff there exists an answer a such that e i — ft a 

where i — ft is the reflexive, transitive closure of i — >. Proposition[T]shows eval^ r eed 
is a function. The following theorem confirms that it equals eva.l neec i- 

Theorem 2. eval need = eval s ^ eed 

Proof. The theorem follows from lemma[31 which shows how to obtain a standard 
reduction sequence for any arbitrary reduction sequence. The front-end of the 
former is a series of standard reduction steps. 

Definition 3 (Standard Reduction Sequences TZ). 

• x C TZ 

• Ax. ex o • • • o Xx.e rn G TZ, if e± o • • • o e m G TZ 

• eo o ex o • • • o e m G TZ, if cq i — > ex and ex o • • • o e m G TZ 

• (exe'i)o---o(e m ex)o(e m e / 2 )o---o(e m e'„) G TZ, ife x o- ■ -oe m , e[o- ■ -oe' n G 1Z. 

Lemma 3 (Curry- Feys Standardization), e —ft e! iff there exists ex o • ■ • © 
e„ G 7?. smc/i that e = e% and e' — e n . 

Proof. Replace —ft with =>s, and the lemma immediately follows from lemma SJ 

The key to the remaining proofs is a size metric for parallel reductions. 
Definition 4 (Size of =>■ Reduction). 

e el =0 



(exe 2 ) =>■ (eie 2 )| = 
Ax.e As.e' = 



ex => ei| + |e 2 
e =*► e'| 



| = 1+|A=*- A'| + |Ax =► Ai| + |^[£7[a:]] =► A'ff^a:]] | + 

|Aa=>^l + #(a;,A'[^[x]]) x|u=^w'| 

w/iere r = A[A X [\x.A[E[x]]] A 2 [v\] => A'[A! 1 [A^[A'[E , [x]]{x:=v r }]]] 
#(x,e) = the number of free occurrences of x in e 

The size of a parallel reduction of a context equals the sum of the sizes of the 
parallel reductions of the subcontexts and subterms that comprise the context. 

Lemma 4. If cq =>■ ex and ex o • • • o e„ G TZ, there exists eo o e\ o • • • o e' p o e n G TZ. 

Proof. By triple lexicographic induction on (1) length n of the given standard 
reduction sequence, (2) I eo => e\ I, and (3) structure of eo0 



We conjecture that the use of Ralph Loader's technique [TS] may simplify our proof. 
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4.3 Observational Equivalence 

Following Morris |22] two expressions e± and e2 are observationally equivalent, 
ei ~ e2, if they are indistinguishable in all contexts. Formally, e\ ~ if and 
only if eval„ ee( i(C[ei]) = eval„ ee( ;(C[e2]) for all contexts C, where 

C= [ ] | Ax.C | Ce | eC (Contexts) 

An alternative definition of the behavioral equivalence relation uses co-induction. 
In either case, X nee d is sound with respect to observational equivalence. 

Theorem 3 (Soundness). If \ n eed r- ei = e-2, then e± ~ e%- 

Proof. Following Plotkin, a calculus is sound if it satisfies Church-Rosser and 
Curry-Feys theorems. 

5 Correctness 

Ariola and Felleisen [3] prove that A a / defines the same evaluation function as 
the call-by-name A calculus. Nakata and Hasegawa [23] additionally demonstrate 
extensional correctness of the same calculus with respect to Launchbury's natural 
semantics |17) . In this section, we show that \ neec i defines the same evaluation 
function as Launchbury's semantics. While our theorem statement is extensional, 
the proof illuminates the tight intensional relationship between the two systems. 

5.1 Overview 

The gap between the X ne ed standard reduction "machine" and Launchbury's 
natural semantics is huge. While the latter's store-based natural semantics uses 
the equivalent of assignment statements to implement the "evaluate once, only 
when needed" policy, the \ nee d calculus exclusively relies on term substitutions. 
To close the gap, we systematically construct a series of intermediate systems 
that makes comparisons easy, all while ensuring correctness at each step. A first 
step is to convert the natural semantics into a store-based machine |27j . 

To further bridge the gap we note that a single-use assignment statement is 
equivalent to a program- wide substitution of shared expressions |10j . A closely 
related idea is to reduce shared expressions simultaneously. This leads to a par- 
allel program rewriting system, dubbed A||. Equipped with A|| we get closer to 
Aneed but not all the way there because reductions in X nee d and A|| are too coarse- 
grained for direct comparison. Fortunately, it is easy to construct an intermediate 
transition system that eliminates the remainder of the gap. We convert X nee d to 
an equivalent CK transition system [5], where the program is partitioned into 
a control string (C) and an explicit context (K) and we show that there is a 
correspondence between this transition system and Aii. 

Figure U outlines our proof strategy pictorially. The four horizontal layers 
correspond to the four rewriting systems. While \ nee d and A|| use large steps to 
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store machine h 

V> J 



, , , . s. go under A P n eed-ck 

CK transitions I s- | s- p s» | 9 



Fig. 4. Summary of correctness proof technique. 



progress from term to term, the machine-like systems take several small steps. 
The solid vertical arrows between the layers figure indicate how mapping func- 
tions relate the rewriting sequences and the dashed arrows show how the smaller 
machine steps correspond to the larger steps of \ nee d and A||: 

• The tp function maps states from the store-based machine to terms in the 
A|| world. For every step in the natural-semantics machine, the resulting 
operation in A|| is either a no-op or a f}\\ reduction, with assignment in the 
store-machine being equivalent to program-wide substitution in A||. 

• Similarly, the £ function maps states of the CK transition system to the 
A|| space and for every CK transition, the resulting A|| operation is also 
either a no-op or a /3m reduction, with the transition that descends under a 
A-abstraction being equivalent to substitution in An. 

• Finally, the <p function maps states of the CK transition system to X nee d 
terms and is used to show that the CK system and Xneed are equivalent. For 
every CK transition, the equivalent \ ne ed operation is either a no-op or a 
ftneed reduction. 



Syntax 



S L = (e,F L s,r) (States) 

Fls = Fl,. . . (List of Frames) 

Fl = (arg e) | (var x) (Frames) 

r = (xM>e, . . .) (Heaps) 



Transitions 



ckh 

(ei e 2 , F L s, F) 1 — > (ei, ((arge 2 ), F L s),r) (push- arg- ckh) 

(\x.ei, ((arge2), Fls), F) (ei{x :=y}, Fls, (F, i/m^)) , y fresh (descend-lam-ckh) 

(x, Fls, (F, x i—¥ e)) ^ (e,(( var x), Fls), F) (lookup-var-ckh) 

(v, ((var x),Fls),T) v— \ (v, Fls, (F, x h- v)} (update-heap- ckh) 



Fig. 5. The natural semantics as an abstract machine. 
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Subsections 15.21 and 15.31 present the store-based machine and the parallel 
rewriting semantics, respectively, including a proof of equivalence. Subsection l5.4l 
presents the CK system and subsection 15.51 explains the rest of the proof. 

5.2 Adapting Launchbury's Natural Semantics 

Figure [5] describes the syntax and transitions of the store machinel^It is dubbed 
CKH because it resembles a three- register machine [5]: a machine state Sl is 
comprised of a control string (C), a list of frames (K) that represents the control 
context in an inside-out manner, and a heap (H). The . . . notation means "zero of 
more of the preceding kind of element." An (arge) frame represents the argument 
in an application and the (var x) frame indicates that a heap expression is 
the current control string. Parentheses are used to group a list of frames when 
necessary. The initial machine state for a program e is (e, (), ()). Computation 
terminates when the control string is a value and the list of frames is empty. 

The push-arg-ckh transition moves the argument in an application to a new 
arg frame in the frame list and makes the operator the next control string. 
When that operator is a A-abstraction, the descend-lam-ckh transition adds its 
argument to the heapH mapped to a fresh variable name, and makes the body of 
the operator the new control string. The lookup -var-ckh transition evaluates an 
argument from the heap when the control string is a variable. The mapping is 
removed from the heap and a new (var x) frame remembers the variable whose 
corresponding expression is under evaluation. Finally, when the heap expression 
is reduced to a value, the update-heap- ckh transition extends the heap again. 

5.3 Parallel Rewriting 

The syntax of the parallel A-rewriting semantics is as follows: 

e || = e | e\\ x (Terms) 
ui = v | u\ x (Values) 
£j| = [ ] | £j| e || | E\\ x (Evaluation Contexts) 

This system expresses computation with a selective parallel reduction strategy. 
When a function application is in demand, the system substitutes the argument 
for all free occurrences of the bound variable, regardless of the status of the 
argument. When an instance of a substituted argument is reduced, however, all 
instances of the argument are reduced in parallel. Here is a sample reduction: 

(Xx.Xx) {II) (I I) X {I I) X l-^U I X I X h-^U I 

5 To aid comparisons, we slightly alter Launchbury's rules (and the resulting machine) 
to use pure A terms. Thus we avoid Launchbury's preprocessing and special syntax. 

6 The notation (T, x^e) is a heap F, extended with the variable-term mapping i^e. 
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The Am semantics keeps track of arguments via labeled terms eii x , where 
labels are variables. Values in A|| also include labeled A-abstractions. Reducing 
a labeled term triggers the simultaneous reduction of all other terms with the 
same label. Otherwise, labels do not affect program evaluation. 

We require that all expressions with the same label must be identical. 

Definition 5. A program is consistently labeled (CL) when for any two sub- 
terms ep 1 and e^ 2 0/ en, x\ = x 2 implies ein = e»2- 

In the reduction of A|| programs, evaluation contexts £j| determine which part 
of the program to reduce next. The Am evaluation contexts are the call- by-name 
evaluation contexts with the addition of the labeled E^ x context, which dictates 
that a redex search goes under labeled terms. Essentially, when searching for a 
redex, terms tagged with a label are treated as if they were unlabeled. 

The parallel semantics can exploit simpler evaluation contexts than \ nee d 
because substitution occurs as soon as an application is encountered: 



£j|[((Ax.e||i)*0e|| 2 ] 



£jl [e||], if [ ] is not under a label in £j| 

F\\[e^tz^F\\ 2 [e^, ii E\[ )=L\ 1 [{L\ 2 \ ])*] (/3,|) 
and [ ] is not under a label in £jp 
where en = ein{x:=eii2},w fresh 



On the left-hand side of /3«, the program is partitioned into a context and a 
/3-like redex. A term e v may have any number of labels and possibly none. On 
the right-hand side, the redex is contracted to a term eu\{x -—e^} such that 
the argument is tagged with an unique label w. Obsolete labels y are discarded. 

There are two distinct ways to contract a redex: when the redex is not under 
any labels and when the redex occurs under at least one label. For the former, 
the redex is the only contracted part of the program. For the latter, all other 
instances of that labeled term are similarly contracted. In this second case, the 
evaluation context is further subdivided as £j| [ ] = £j|i[(£j|2[ ]) z ], where z is the 
label nearest the redex, i.e., E12 contains no additional labels. A whole-program 
substitution function is used to perform the parallel reduction: 

eilifc^eiiJ- = ef 

e||Tfe^e||S = (e||ife^e||]f) x , x^y 
(Aa;.e||i){a;-<=e||J = \x.{e\\i$x-$=e$) 
(e||ie|| 2 ){a;^=e||J = (e||i|a;^=e|||e||2|a;^=e|||) 
otherwise, e||i§x<=e||}j- = ep 

Rewriting terms with preserves the consistent labeling property. 
Proposition 2. If eu is CL and e« h-!U e«' , then e«' is CL. 
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The i\) function reconstructs a Aii term from a CKH machine configuration: 



1 p((e,((v a xx),F L s),r))=iP((x,F L s,(x^e,r))) | j> : S L ->■ e M 
V((ei, ((arge 2 ), F L s), r)) = ^((e x e 2 ,F L s, r» 

^((e,(),r)) = efrj 

The operation e§_T§, using overloaded notation, replaces all free variables in e 
with their corresponding terms in r and tags them with appropriate labels. 
Lemma [5] demonstrates the bulk of the equivalence of the store machine and 
The rest of the equivalence proof is straightforward [9] . 

Lemma 5. If (e, Fj,s, r) h— ^ {e 1 , Fjjs' , T') , then either: 

1. ip({e,F L s,r))=Tp((e',F L s',r')) 

2. ip((e,F L s,r))Jkip({e',F L s',r>)) 



5.4 A Transition System for Comparing \ nee d and Ay 

The CK layer in figure 2] mediates between Aii and X ne ed- The corresponding 
transition system resembles a two-register CK machine [9] . Figure [6] describes 
the syntax and the transitions of the system!! 



Syntax 



S = (e, Fs) 
Fs = F,... 

F — (arg e) | (lam x) | (bod x Fs Fs) 



(States) 
(List of Frames) 
(Frames) 



Transitions 



(ei e 2 , Fs) ^ (ei, ((arg e 2 ), Fs)) 
{Xx.e, Fs) i ck 



(push-arg-ck) 
(descend-lam-ck) 



■ (e, ((lama;),Fs)) 
if balance (Fs) > 

ck 

(x, (Fsi, (lama;), Fs2, (arg e), Fs)) h— ► (e, ((bod a; Fsi i*S2], Fs)) (lookup-var-ck) 
if #(Fsi) £ A[E],(j) F (Fs2) £ A,0 F (Fs) £ £ A 



(«, (Fs 3 , (bod a; F Sl Fs 2 ), Fs)) > 



•(^(Fsijx—^Fsa,^,^)) 
if <^(Fs 3 ) £ A 



(A. 







Fig. 6. A transition system for comparing \ nee d and A 



States consist of a subterm and a list of frames representing the context. The 
first kind of frame represents the argument in an application and the second 

7 The lemma relies on an extension of the typical a-equivalence classes of terms to 
include variables in labels as well. 

8 The CK transition system is a proof-technical device. Unlike the original CK ma- 
chine, ours is ill-suited for an implementation. 
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frame represents a A-abstraction with a hole in the body. The last kind of frame 
has two frame list components, the first representing a context in the body of 
the A, and the second representing the context between the A and its argument. 
The variable in this last frame is the variable bound by the A expression under 
evaluation. The initial state for a program e is (e, ()}, where () is an empty list 
of frames, and evaluation terminates when the control string is a value and the 
list of frames is equivalent to an answer context. 

The push-arg-ck transition makes the operator in an application the new 
control string and adds a new arg frame to the frame list containing the ar- 
gument. The descend-lam-ck transition goes under a A, making the body the 
control string, but only if that A has a corresponding argument in the frame list, 
as determined by the balance function, defined as follows: 



balance(i<S3, (bod a; Fs\ Fs-z), Fs) = balance(i*S3) 

balance(-Es) = #arg-f raiaes(Fs) — #lam-f rames(Fs) 
Fs contains no bod frames 

The balance side condition for descend-lam-ck dictates that evaluation goes 
under a A only if there is a matching argument for it, thus complying with the 
analogous evaluation context. The balance function employs #arg-f rames and 
#lam-f rames to count the number of arg or lam frames, respectively, in a list 
of frames. Their definitions are elementary and therefore omitted. 

The lookup- var-ck transition is invoked if the control string is a variable, 
somewhere in a A body, and the rest of the frames have a certain shape consistent 
with the corresponding parts of a (3 nee d redex. With this transition, the argument 
associated with the variable becomes the next control string and the context 
around the variable in the A body and the context between the A and argument 
are saved in a new bod frame. Finally, when an argument is an answer, indicated 
by a value control string and a bod frame in the frame list — with the equivalent 
of an answer context in between — the value gets substituted into the body of the 
A according to the /3 neec i- c k transition. The fi ne ed-ck transition uses a substitution 
function on frame lists, Fs{x :=e}, which overloads the notation for regular term 
substitution and has the expected definition. 

Figure [7] defines metafunctions for the CK transition system. The <j) function 
converts a CK state to the equivalent X nee d term, and uses <Pf to convert a list 
of frames to an evaluation context. 

Now we can show that an evaluator defined with i—t is equivalent to eval^ r eed . 
The essence of the proof is a lemma that relates the shape of CK transition 
sequences to the shape of \ nee d standard reduction sequences. The rest of the 
equivalence proof is straightforward [9] . 

Lemma 6. If (e,Fs)i—^(e',Fs'}, then either: 



balance : Fs 




1. cf>((e,F S )) = <t>((e>,Fs')) 

2. 0((e,Fs))^0((ei,Fs')) 
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S- 



4>{{e,Fs)) = MFs)[e 



Fs 



MO) = [ } 

(j) F ((lam x),Fs) = <j> F (Fs)[\x.{ ]] 

M(^ge),Fs) = MFs)[[ ]e] 
fa ((bod x Fsi Fs 2 ) , Fs) = 

MFs)[MFs2)[Xx.MFs 1 )[x]] [ ]] 



£((e,Fa}) = Z F (Fs,e) 



£f : Fs x eii 



^(((arge||i),i%),e||) = £F(-Fs,ey eyi) 
^(((boda;aii^2),Fs),e||) = 

((Fsi , (lam x) , Fs 2 , (arg ey), Fs) , x) 
( F (((lamx),Fsi, (arge||i),Fs 2 ),e||) = 

fr((Fs 1 ,Fs 2 ),e ll {x:=e\ l y 1 }) 
<fa(Fsi) € A, y fresh 



Fig. 7. Functions to map CK states to \„ e ed (<t>) and Ay (£). 



Finally, we show how the CK system corresponds to Am. The £ function 
defined in figure [7] constructs a An term from a CK configuration. 



Lemma 7. If (e, Fs) <—$■ {e' , Fs'} , then either: 
1. Z((e,Fs))=£((e',Fs')) 



5.5 Relating all Layers 

In the previous subsections, we have demonstrated the correspondence between 
Am, the natural semantics, and the X nee d standard reduction sequences via lem- 
mas [5] through [7] We conclude this section with the statement of an extensional 
correctness theorem, where eval natura ; is an evaluator defined with the store ma- 
chine transitions. The theorem follows from the composition of the equivalences 
of our specified rewriting systems. 

Theorem 4. eval need = eval natural 



6 Extensions and Variants 

Data Constructors Real-world lazy languages come with data structure con- 
struction and extraction operators. Like function arguments, the arguments to 
a data constructor should not be evaluated until there is demand for their val- 
ues [12114] . The standard A calculus encoding of such operators l5j works well: 

cons = Air. Ay. As. s x y, car = Xp.p Xx.Xy.x, cdr = Xp.p Xx.Xy.y 

Adding true algebraic systems should also be straightforward. 



The Call-by-need Lambda Calculus, Revisited 



19 



Recursion Our X nee d calculus represents just a core A calculus and does not 
include an explicit letrec constructor for cyclic terms. Since cyclic programming 
is an important idiom in lazy programming languages, others have extensively 
explored cyclic by-need calculi, e.g., Ariola and Blum pQ, and applying their 
solutions to our calculus should pose no problems. 

7 Conclusion 

Following Plotkin's work on call-by-name and call-by-value, we present a call- 
by-need A calculus that expresses computation via a single axiom in the spirit of 
p. Our calculus is close to implementations of lazy languages because it captures 
the idea of by-need computation without retaining every function call and with- 
out need for re-associating terms. We show that our calculus satisfies Plotkin's 
criteria, including an intensional correspondence between our calculus and a 
Launchbury-style natural semantics. Our future work will leverage our X nee d 
calculus to derive a new abstract machine for lazy languages. 
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